Picture for Xiangyang Xue

Xiangyang Xue

Fudan University

MindVoice: Reconstructing Intelligible Speech from Non-invasive Neural Signals with Pretrained Priors

Add code
May 29, 2026
Viaarxiv icon

Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance

Add code
May 22, 2026
Viaarxiv icon

OFlow: Injecting Object-Aware Temporal Flow Matching for Robust Robotic Manipulation

Add code
Apr 20, 2026
Viaarxiv icon

DINO-VO: Learning Where to Focus for Enhanced State Estimation

Add code
Apr 05, 2026
Viaarxiv icon

ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models

Add code
Mar 22, 2026
Viaarxiv icon

OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer

Add code
Mar 15, 2026
Viaarxiv icon

DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving

Add code
Mar 09, 2026
Viaarxiv icon

Vision-Language Feature Alignment for Road Anomaly Segmentation

Add code
Mar 01, 2026
Viaarxiv icon

Universal Pose Pretraining for Generalizable Vision-Language-Action Policies

Add code
Feb 23, 2026
Viaarxiv icon

EgoSound: Benchmarking Sound Understanding in Egocentric Videos

Add code
Feb 15, 2026
Viaarxiv icon